Topological Properties of Integer Networks 
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Inspired by Pythagoras's belief that numbers represent the reality, we study that the topological 
properties of networks of composite numbers, in which the vertices represent the numbers and two 
vertices are connected if and only if there exists a divisibility relation between them. The network 
has a fairly large clustering coefficient of « 0.34, which is insensitive to the size of the network. The 
average distance between two nodes is shown to have an upper bound that is independent of the 
size of the network, in contrast to the behavior in small-world and ultra-small-world networks. The 
out-degree distribution is shown to follow a power law behavior of the form k~ 2 . 

PACS numbers: 89.75.Hc, 64.60.Ak, 84.35.-M, 05.40.-a, 05.50+q, 87.18.Sn 
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Many social, biological, and communication systems 
form complex networks, with vertices representing indi- 
viduals or organizations and edges representing the in- 
teractions between them 

Q II Examples are nu- 
merous, including the Internet, the World Wide Web, 
social networks of acquaintance and other relationship 
between individuals, metabolic networks, food webs, etc. 
EH3 IS S El Empirical studies on real- life networks 
reveal some common characteristics different from ran- 
dom networks and regular networks. Among these fea- 
tures, the most noticeable are the small-world effect and 
scale-free property [Tfl ITU Il2| . In this Letter, inspired by 
Pythagoras' belief that numbers represent absolute real- 
ity, we study the topological properties of the most nat- 
ural network in Pythagoras' sense, namely the network 
consisting of integers. 

The distance d between two vertices in a network is 
defined as the number of edges along the shortest path 
connecting them. The average distance L of the network 
is then defined as the mean distance between two vertices, 
averaged over all pairs of vertices. The average distance 
is one of the most important properties in measuring the 
efficiency of communication networks. In a store-forward 
computer network, for example, the most useful measure 
characterizing the performance is the transmission delay 
(or time delay) in sending a message through the network 
from the source to the destination. The time delay is ap- 
proximately proportional to the number of edges that a 
message must pass through. Therefore, the average dis- 
tance plays a significant role in measuring the time delay. 
Other important topological properties include the clus- 
tering coefficient and degree distribution. The degree of 
a vertex x, denoted by k{x), is the number of the edges 
that are attached to the vertex. Through the k(x) edges, 
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there are k(x) vertices that are correlated with or con- 
nected to the vertex x. These neighboring vertices form 
a set of neighbors A{x) of x. The clustering coefficient 
C(x) of the vertex x is the ratio of the number of exist- 
ing edges among all the vertices in the set A(x) to the 
total possible number of edges connecting all vertices in 
A(x). The clustering coefficient C of the entire network 
is the average of C(x) over all x. Empirical studies in- 
dicate that many real-life networks have much smaller 
average distances (with L ~ In S, where S is the number 
of vertices in the network) than a completely regular net- 
works and have much larger clustering coefficients than 
the completely random networks. Studies on real-life net- 
works also show a power-law degree distribution of the 
form p(k) ~ fc~ 7 , where p(k) is the probability density 
function for the degrees and 7 is an exponent. This distri- 
bution falls off slower than an exponential, allowing for a 
few vertices with large degrees. Networks with power-law 
degree distribution are referred to as scale-free networks. 

Here, we investigate a network in which the vertices 
represent a set of positive integers. Two vertices x and y 
are linked by an edge if and only if x is divisible by y or 
y is a factor of x. By definition, the topological structure 
of the network is determined, once the set of vertices 
under consideration is known. For example, Fig.l shows 
the topological structure of a network with the set of 
vertices representing the integers {4, 6, 8, 9, 10, 12, 14, 
15, 16, 18, 20, 21, 22, 24, 25, 26, 27, 28, 30}. 

Formally 13], a network is represented by a graph 
G(V, E), where V is the set of vertices and E is the set 
of edges. A graph G is said to be connected if any two 
vertices of G are connected, i.e., one can go from one 
vertex to another through the edges in the network. If a 
graph G is not connected, it consists of several disjoint 
components. The example in Fig.l has five components: 
VI ={4, 6, 8, 9,10, 12, 14, 15, 16, 18, 20, 24, 27, 28, 30}, 
V2 ={21}, V3 ={22}, VA ={25} and V5 ={26}. Al- 
ir etworks are not always connected, most 
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FIG. 1: The topological structure of an integer network with 
the set of vertices representing the integers V ={4, 6, 8, 9, 
10, 12, 14, 15, 16, 18, 20, 21, 22, 24, 25, 26, 27, 28, 30}. The 
corresponding graph consists of 19 vertices and 19 edges, and 
shows 5 disjoint components. The largest component has 15 
vertices and 19 edges, which is denoted by G30. 



previous studies have focused on connected graphs. In 
the analysis of the network of integers, we only keep the 
largest connected component if the given set of integers 
gives a disconnected graph. With this choice, the sets of 
vertices {4, 6, 8, 9, 10, 12, 14, 15, 16, 18, 20, 21, 22, 24, 
27, 28, 30} and {4, 6, 8, 9, 10, 12, 14, 15, 16, 18, 20, 24, 
27, 28, 30}, for example, will generate the same graph 
consisting of 15 vertices and 19 edges. 

Different sets of vertices (positive integers) will, in gen- 
eral, lead to graphs of different properties. For instance, 
the set of integers {1, 2, 4, • • • , 2"} generates a complete 
graph, and the set consisting of all the prime numbers 
and 1 leads to an infinite star graph. Here we focus on 
a special class of networks of integers in which the ver- 
tices are composite numbers. A composite number is a 
positive integer greater than 1 and is not prime. We use 
the symbol Gn to denote the network generated by the 
set of composite numbers less than or equal to A, i.e., 
Vn = {x\x G Q, 4 < x < A}, where Q is the set of all the 
composite numbers. For example, the largest component 
of the network in Fig. 1 is the graph G30 . Note that the 
number of vertices in Gat grows with A. We have calcu- 
lated the clustering coefficients G in the networks Gat up 
to A = 20, 000 and found that the clustering coefficients 
only fluctuate very slightly about the value of 0.34 with 
A (see inset of Fig. 2). 

Figure 2 also shows the dependence of the average dis- 
tance L in the networks Gn for A up to 20,000. Af- 
ter a range of A less than 5,000 for which L increases 
from about 1.8 to 2.4, the average distance increases 
fairly slowly with A for A > 5,000. Note that the av- 
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FIG. 2: The average distance L in the network Gn for net- 
work size up to TV = 20, 000. For TV > 5, 000, L increases only 
slowly with TV. Analytically, there exists an upper bound for 
L that is independent of TV. The inset shows the clustering 
coefficient C, which fluctuates only slightly about the value 
0.34 as TV increases. 

erage distance L ~ In A in small- world networks [lOj and 
L ~ lnlnA in ultrasmall- world networks Here, we 
show that the diameter [l5j D of Gn must be smaller 
than a constant M that is independent of network size 
characterized by A, i.e., L < M in Gn for arbitrary A, 
where M is a constant. For any vertex x £ Gn, we first 
prove that the distance between x and the smallest in- 
teger 4 in Gn satisfies d(x,4) < 4. Since x G Q, it can 
be written as x = p\P2 • • -Pg, where p\ > P2 > ■ ■ ■ > p q 
are prime numbers. If q > 4, then xjp\p2 G Q, thus 
x/piP2 G A(x). Noting that Ax/piP2 < x < A, there 
exists a path of length 3 from a; to 4 through the vertices 
x/piP2 and 4x/piP2- If 3 = 3 and p\ > 4, analogously, 
a path through the vertices x/pi and 4x/pi exists be- 
tween x to 4. If q — 3 and p\ < 4, then x can only be 
x= 8, 12, 18, and 27, for which 8 and 12 are connected 
directly to 4. For A > 107, there exists a path from 
a; to 4 through Ax. Therefore, for any vertex x with at 
least 3 prime factors, d(x, 4) < 3. Finally, for q = 2, 
since A(x) ^ 0, y — 2x must be in A(x). Then y has at 
least 3 prime factors: pi, P2 and 2, leading to d(y, 4) < 3 
and thus d(x, 4) < 4. Therefore, for any two vertices 
a,b G Gat, we have d{a, b) < d(a,4) + d(b, 4) < 8. This 
implies D < 8. Since the average distance L < D, the 
networks Gn have a constant upper bound for L. The 
networks Gn are, thus, distinguished from the other net- 
works by having a large clustering coefficient C (« 0.34) 
and a constant upper bound to the average distance L. 

Figure 3 shows the degree distributions of the network 
Gioooooo- The degree of a vertex x is the sum of the in- 
degree and out-degree: k{x) = ki n {x) + k out (x), where 
the in-degree of x is defined as the number of elements 
in the set A(x) that are factors of x, and the out-degree 
of x is the number of elements in A{x) that are divisible 
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FIG. 3: The (a) in-degree and (b) out-degree distributions 
of the network Gioooooo • The y — axis gives the total number 
of nodes having degree k. The line in (b) shows that the out- 
degree distribution follows a power law of the form ~ k~ 2 . 
The total degree (with in and out degrees counted together) 
distribution is shown in (c). 



by x. Figure 3(a) and (b) show the in-degree and out- 



degree distributions. Note that the range of in-degrees 
is smaller than that of out-degrees in the network G]y. 
The in-degree distribution does not show any power law 
behavior. For the out-degrees, k out {x) — [^J — 1. This 
leads to P(k out ) ~ ( k+1 f( k+2 ) ~ k~ 2 , for a range of k 
larger than unity. This behavior is shown by the straight 
line in Fig. 3(b). The behavior for very large k is limited 
by the size of the network characterized by N. Counting 
the in-degree and out-degree together, Fig. 3(c) shows the 
total degree distribution. 

In summary, we proposed and studied a kind of net- 
works that can be regarded as the most natural, namely 
networks consisting of sets of composite numbers as ver- 
tices. Such networks exhibit interesting features that are 
different from many previously studied real-life networks. 
Most noticeably, these networks show a fairly large clus- 
tering coefficient. The average distance between two ver- 
tices is shown to have an upper bound that is independent 
of the network size. The out-degree distribution follows 
a power law. The structure of such networks may be use- 
ful, e.g., in communications, due to its special topological 
properties. Mathematical objects, although unintention- 
ally defined to be so at the beginning, very often have 
close connections to the physical world. It is, therefore, 
worth investigating the secret of other networks of math- 
ematical objects such as those consisting of groups and 
rings as vertices. 
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